Computational requirement of schema matching algorithms

نویسندگان

  • PETER MARTINEK
  • BELA SZIKORA
چکیده

The integration of different data structures e.g. relational databases of information systems is a current issue in the area of information sciences. Numerous solutions aroused recently aiming to achieve a high accuracy in similarity measurement and integration of schema entities coming from different schemas. Researches usually properly evaluate the capabilities of these approaches from the point of view of accuracy. However the computational complexity of the proposed algorithms is hardly ever examined in the most of these works. We claim that efficiency of a solution can only be judged by taking into account both the accuracy and the computational requirements of participating algorithms. Since there are many known measurement methods and metrics for the evaluation of accuracy, the focus is set for the analysis of their computational complexity in this paper. After the problem formulation the main ideas behind our method are presented. Various approximation techniques and methods of applied algorithm theory are used to evaluate the different approaches. Three specific approaches were also selected to present the work of our method in details on them. Experiments run on several test inputs are also included. Key-Words: Computational complexity, Schema matching, Approximation techniques in computational requirement estimation

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational complexity of schema matching approaches

Comparing and integrating of different data structures e.g. relational databases of information systems is a current problem in information sciences. Various solutions have appeared in the last 10 years aimed to achieve a high accuracy level in schema integration and similarity measurement of entities originating from different schemas. The capabilities of approaches are usually properly evalua...

متن کامل

An Improved Semantic Schema Matching Approach

Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...

متن کامل

New Challenges in Data Integration: Large Scale Automatic Schema Matching

Today schema matching is a basic problem in almost every data intensive distributed application, namely enterprise information integration, collaborating web services, ontology based agents communication, web catalogue integration and schema based P2P database systems. There has been a plethora of algorithms and techniques researched in schema matching and integration for data interoperability....

متن کامل

New Challenges : Large Scale Automatic Semantic Integration

Today schema matching is a basic problem in almost every data intensive distributed application, namely enterprise information integration, collaborating web services, ontology based agents communication, web catalogue integration and schema based P2P database systems. There has been a plethora of algorithms and techniques researched in schema matching and integration for data interoperability....

متن کامل

PLASMA: A Platform for Schema Matching and Management

This paper introduces an XML Schema management platform that promotes the use of matching techniques to fulfill the requirements of data integration and data exchange. The existing platforms, in the market, deal only with graphical but not automatic matching. Several matching algorithms were suggested, by different researchers, to automate the correspondences discovery between XML Schemas. Thes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009